Anthropic Open-Sources Petri — Automated Framework to Audit LLM Behavior at Scale
'Anthropic open-sourced Petri, a tool that automates alignment audits by orchestrating auditor and judge agents to probe LLMs across multi-turn, tool-augmented scenarios, revealing misaligned behaviors in a 14-model pilot.'